首页> 外文OA文献 >Determining the Best K for Clustering Transactional Datasets: A Coverage Density-based Approach
【2h】

Determining the Best K for Clustering Transactional Datasets: A Coverage Density-based Approach

机译:确定用于聚类交易数据集的最佳K:基于覆盖密度的方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The problem of determining the optimal number of clusters is important but mysterious in cluster analysis. In this paper, we propose a novel method to find a set of candidate optimal number Ks of clusters in transactional datasets. Concretely, we propose Transactional-cluster-modes Dissimilarity based on the concept of coverage density as an intuitive transactional inter-cluster dissimilarity measure. Based on the above measure, an agglomerative hierachical clustering algorithm is developed and the Merge Dissimilarity Indexes, which are generated in hierachical cluster merging processes, are used to find the candidate optimal number Ks of clusters of transactional data. Our experimental results on both synthetic and real data show that the new method often effectively estimates the number of clusters of transactional data.
机译:确定最佳聚类数的问题很重要,但在聚类分析中却是个谜。在本文中,我们提出了一种新颖的方法来在事务数据集中找到一组候选最优簇数Ks。具体地,我们基于覆盖密度的概念提出事务-集群模式差异作为一种直观的事务间集群间差异度量。基于上述措施,开发了一种聚类的层次聚类算法,并在层次聚类合并过程中生成的合并不相似指数用于查找事务数据聚类的候选最佳数目Ks。我们在综合数据和真实数据上的实验结果表明,该新方法通常可以有效地估算交易数据簇的数量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号